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criterion for use in placement of mentally retarded children, 127 
subjects were involved. The California Achievement Test (CAT) was 
used as a criterion measure, and the Primary Mental Abilities test 
(OMA) and the Slosson Intelligence Test (SIT) were used to determine 
the correlation of the scores with the criterion measure. The results 
indicate* correlations of .68 between the PliA and the CAT and .62 
between the SIT and the CAT. The intercorrelation of the PM A and SIT 
was . 67. Ability subtests were analyzed to determine their 
correlations to the criterion, ^he conclusion was that when six or 
more children are to be tested, less time if needed and no concurrent 
validity is lost by using the PEA rather than the individual test, 
the SI'*. Tables of results are included. (Author/JM) 
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A COMPARISON Or AN ACHifcVfcMENT BATTERY WITH TWO TESTS 
OF ABILITY WITH EDUCABLE MENTAL RETARDATES 



Summa rv 



The purpose of this study was to find the concurrent validity 
of two scholastic aptitude tests when a scholastic achievement 
test was used as a criterion and when the tests were applied to 
children with mental retardation. 

The use of the instrument with the greatest concurrent 
validity will increase the efficiency and accuracy with which 
mentally retarded children can be placed in special classes. 

Children (184) in sixteen elementary special classes in 
Leon County, Florida were administered a criterion measure, 
the California Achievement Test. The Primary Mental Abilities 
test and the Slosson intelligence Test were also administered 
to determine the correlation of the scores with the criterion 
measure, and intercorrelations between tests. 

Attrition, due to absenteeism and an inability of the sub- 
jects to understand the verbal directions of the CAT reduced the 
number of subjects to 127. 

Analysis of the data using a stepwise multiple regression 
computer program indicated that the PMA correlated r=0.676 and 
the SIT correlated r=0.6l8 with the criterion measures. These 
correlations were not significantly different at the .05 level 
of confidence. The intercorrelation of the PMA and SIT was 
r=0.665. A secondary analysis determined which ability subtest 
or total correlated most highly with each of the nine subscores 
of the criterion variable. The means and standard deviations 
of all subscores were found. 

It was concluded that when six or more children are to be 
tested an economy of test administrat ion time may be effected 
without loss of concurrent validity, by employing the group 
test (PMA) rather than the individual test (SIT). 



A COMPARISON OF AN ACHIEVEMENT BATTERY WITH TWO TESTS 
OF ABILITY WITH EDUCABLE MENTAL RETARDATES 



I nt roduct ion 



According to Heber's definition, to be classified as mentally 
retarded, a child must display subnormal intellectual function*" 
ing, which is usually measured by a standardized intelligence 
test, and maladaptive behavior of which school achievement, is a 
primary criterion in school age children. Customarily school 
achievement is measured by using a standardized achievement test. 

The intelligence test was devised as a predictor of school achieve- 
ment and must therefore be classified as a test of scholastic 
aptitude rather than a test of intelligence. Consequently iri 
order to classify a child as mentally retarded, an aptitude 
and a post-facto test of school success must coincide. School 
achievement tests and scholastic aptitude tests should, there- 
fore, correlate highly with one another when they are applied 
to children with mental retardation. 

The purpose of this study was to find the concurrent 
validity of two scholastic aptitude tests and a scholastic, 
achievement test when the tests were applied to children with 
mental retardation. The intelligence test having the higher 
correlation with an achievement test would be the better ap- 
titude test for identifying such children. 

Mueller (1965) investigated the empirical validity of six 
psychometric tests, as predictors of learning ability: The 

Stanford Binet Intelligence Scale (S-B, Terman & Merrill, I960); 
the Illinois Test of Psychol ingu ist ic Abilities (1TPA, McCarthy 
& Kirk, 1961); the Peabody Picture Vocabulary Test (PPVT, -Dunn, 
1959); the Pictorial Test of Intelligence (PTI , French, 1963); 
the Colored Progressive Matricies (CPM, Raven, 19^7); and the 
Primary Mental Abilities Test (PMA, Thurstone and Thurstone, 

1962). He found, "The picture which emerged^in these compari- 
sons suggests that the PMAT, S-B, PTI, and I * PA are the most 
valid of the predictor tests under consideration in predicting 
learning ability in young mental retardates, ... the PMAT was 
superior to both the Binet and PPVT in predicting reading and 
arithmetic achievement." (Mueller, 1965). 

* The PMA (Thurstone & Thurstone, 1962) is a teacher-administered 
group screening instrument of mental maturity. In view of the 
correlation (0.79) reported by Mueller for the PMA with his learning 
criteria, and reading and arithmetic achievement, investigation 
of the concurrent validity of the PMA wnen compared with other 
teacher-administered instruments especially the more time con- 
suming individual tests, should be conducted. 



One such screening instrument which seems to be gaining con- 
siderable prominence as a teacher-administered test, is the Slos- 
son Intelligence Test (SIT, Slosson, 1 963)"-* Admission to special 
education classes in some states is determined on the basis of an 
individual psychological examination by a qualified examiner. The 
SIT meets this requirement in that it is an individually adminis- 
tered psychological test. The manual lists teachers, social work- 
ers, guidance counselors and others as qualified examiners. How- 
ever, the SIT has been used on the basis of its face validity a- 
lone; e.g., there is currently no statement of the SIT validity cr 
statistical reliability in the literature 1 (Baumeister, 1967). To 
be valid for the purpose of admitting to special classes,, it is 
necessary that the SIT, as does the PMA, correlate highly with a 
scholastic achievement test. In this study the PMA form K-l was 
found to be appropriate for the subjects. The SIT is obtainable 
in only one form. 

An achievement test which has been used extensively and which 
lends itself well to educational planning, due to the diagnostic 
analysis page included in the test protocol, is the California A- 
chievement Test (CAT, Tiegs & Clark, 1957 with 1963 norms). The 
CAT has reported test retest reliability coefficients of from .97 
to .98 on the three lower ranges of the battery. The lower primary 
Form W, grades one and two, was used. 

The scores obtained on the CAT measure served as an accepted 
contemporary criterion of performance. The concurrent validity cf 
the SIT was determined by correlating the scores obtained oh the 
SIT with those obtained on the criterion measure. The concurrent 
validity of the PMA obtained in a like fashion. The intercorrela- 
tions between the PMA and the SIT scores were also determined. 

Usually a school teacher cannot administer an S-B test in -a 
valid manner, which is unfortunate because teachers make the 
greatest use of the intelligence quotient and it is in the S-B . 
that psychomet rists usually have the most faith. Mueller’s study 
may be seen as part of a real, though unorganized and perhaps not 
always conscious, nationwide program to determine which teacher ad- 
ministered IQ. test is most deserving of the teacher's faith. The 
search is based on the logic that if children benefit f rom’ teaqher- 
gathered IQ. scores, then the benefit should be a positive function 
of the validity and reliability of the tests employed. Mueller*;s 
study indicates that the PMA has high concurrent validity with his 
criteria. 

The present study extends Mueller's work by comparing the con- 
current val idity, using a more commonly found criterion# of the SIT 
and the PMA. The SIT can.be character ized as a brief $-8 type, of 
teacher administered test. 

•a. 

Method . ; 

The experimental subjects, cons isted of the total number (18A) of 
children who were enrolled in the sixteen elementary public school 
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special classes for the educable mentally retarded in Leon County, 
Florida, all experimental testing was accomplished in two weeks 
per class. Those subjects not present for all testing sessions 
were excluded from the sample. To control for systematic bias 
due to non- random absenteeism, no testing was done during weeks 
that included national or religious holidays which would have 
tended to systematically exlude a segment of the sample. Attri- 
tion was high due to absenteeism and a seeming inability for some 
children to understand the verbal instructions necessary for the 
administration of the CAT. 

Administration of the test instruments was accomplished by 
trained student examiners. All of the tests were administered 
to all of the subjects. The group tests were administered to 
each class in the manner described in the manuals. The SIT was 
individually administered in the standard manner. The order of 
test administration was randomized. The protocols were scored 
and recorded by the examiners and then re-scored by the inves- 
tigator to assure accuracy and consistency. 

Complete data was collected from 146 subjects. Inspection 
of the protocols by an expert revealed that some protocols had to 
be judged invalid. This judgment was usually made on the basis 
that the child must have misunderstood the task. When all pro- 
tocols were checked, complete and valid data were available for 
127 subjects. These were subjected to statistical analysis. 

Analysis and Findings 

All scores were derived as age equivalents from norms pub- 
lished in their respective manuals. The data from all tests and 
subtests, where necessary, were converted to age equivalents in 
months by the rule of five (Dunn, 1964, p.60). Thus the third 
month of grade one yielded an achievement age of 6 years 3 months 
or 75 months. These data were analysed in a large computer using 
Efroymson's program for stepwise multiple regression. Table 1 
displays tho means and standard deviations for subtests and totals 
of all three tests. ..The tab]e indicates that a ratio IQ. obtained 
from the formula X 100= • Q., yieias a mean IQ of 71 for the total 
population when STosson MA is used, and 65 IQ. when PMA scords are 
used. On only one collection of data, the Perceptual Speed sub- 
test of the PMA, did the subjects perform better that would be 
anticipated by their SIT score and its grade equivalent by the rule 
of five. On all other subtests and total scores, the performance 
was below this expectation. In this study the total score of the 
PMA was a more accurate predictor of school achievement than was 
the SIT even though both tended to overestimate the achievement. 

The total battery mean score for the CAT was 78 months, the PMA 
total was 83 months and the SIT total was 91 months. These data 
indicate that in this particular study moieaccurate predictions 
of CAT scores could have been obtained by us mg the PMA in a group 
testing situation than by using the individually administered SIT. 
However, these results can not be generalized beyond this study 
because the correlations were not significantly differentat the 
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.05 level. 



Table 2 shows the correlations between each ability subtest 
/ and total when the CAT total is used as the criterion variable. 

• The table indicates that the PMA total score correlates r=G.676 
and the SIT total correlates r=0.6l8, with the criterion. A 
; test of the differences of correlations by the method of Walker 
and Lev ( 1 958) revealed that the apparently higher concurrent 
validity of the PMA could have this appearance by chance alone 
(p=^j0) . The most that can be said from this study is that the 
concurrent validity of the two tests- of intelligence do not differ 
by more than can be attributed to the chance factors inherent in 
the tests, the administrations of the tests, and other experimental 
procedures. It does appear to be certain, however, that if more 
than six children are to be examined, an economy of administration 
time with no sacrifice of validity can be obtained through the use 
v of the group PMA. 

Table 3 shows the partitioning of -the total sum of squares 
---into portions attributable to each of the contributing factors. 

It verifies that in the present study the PMA total score was the 
best predictor of the criterion variable,, and that' adding knowledge 
of the SIT score yielded an increase, in r square bf only .05. The 
•intercorrelation of the PMA and SJT is r = 0.665 (Tdble 2).' It must 
.therefore be concluded that the practice- of non-select ively admin- 
istering both tests is uneconomical of time in that knowledge of 
-- the .SIT score adds but little increased confidence in obtaining 
a valid measure, when the PMA score is known. 

A secondary concern of this study was to determine which ability 
subtest or total, cprrelated most highly with each of the nine sub- 
scores of the criterion variable. These results are presented in 
.Table 4 which includes drily .those .factors that increase r square 
by more than .01. The tab,] e indicates that the SIT totai score 
was not the best p redactor of. any of the subscores or the total 
of the CAT. The PMA total score was the single best predictor of 
Reading Vocabulary subtest and the total battery. The PMA Ver- 
bal Meaning subtest was the. single best predictor of Reading Com- 
prehension, Total Reading j Mechani cs- of English, Spelling, and 
Total Language subscores of the CAT. The PMA' -Number Facility sub- 
test was the single best predictor of Arithmetic Reasoning,' A r.ith- 
. metic Fundamentals and Arithmetic Total, subscores of the CAT. The 
SIT total score wa$ the second best indicator of these same arith- 
metic subscores. - 

. i - * , 

m I 

Cone 1 us i ons • and Recommend^t ? ons 

r,The experimenter is nq.t satisfied with the results of this 
study. The testing us ing ,-th.e. SIT progressed easily and satis- 
factorily as did the testing .with the PMA, = but with the Southern 
elementary EMR 'c'hi Idren, 60 .pe^. cent, of whom were Negro, admin- 
istering thb CAT, which was th ; e .important \ori ter I on measure, was 
fraught witH' difficult ies., .The youngest one^quarter of the pop-, 
ulation seemed to be working ( ..,on the. ragged edge of thei r^abi lit ies 
when simply comprehending, t be. ye rbal . instruct ions. Attrition selec- 



Means and Standard Deviation of all Subtests and Totals 



Variable 



Mean 



Standard 



PMA 

Verbal Meaning 80 

Perceptual Speed 93 

Number Facility 83 

Spatial Relations 80 

Total Score 83 

SIT 

HA 90 

CA 127 



.94 


14.04 


.02 


16.91 


.91 


14.72 


.19 


20.78 


.20 


14.85 


.77 


15.26 


.42 


21.40 



CAT 

Total Reading Vocabulary 78.33 
Reading Comprehension 74.26 
Total Reading 77.83 
Arithmetic Reasoning 79.06 
Arithmetic Fundamentals 82.31 
Total Arithmetic 79. 5** 
Total Mechanics of English 77.46 
Spelling 76. 5^ 
Total language 76.98 
Total Battery 77.87 



7.22 

8.29 

6.55 

6.62 

12.03 

8.08 

66.77 

12.23 

7.49 

6.84 



Table 2 




Prgrfietnr Subtests and Totals with Criterion^ 



VARIABLE 

NUMBER 

1 

2 

3 

4 

5 

6 

7 

8 



Verbal 

Meaning 

i 


Percep- 

tual 

Speed 

2 


Number 

Facility 

• # 


Spatial 

Rela- 

tions 

4 


PMA 

Total 


CAT 

Total 

6 


SIT 

Total 

7 


CA 

Total 

L_ 


1 

1.009 


.491 


.668 


.685 


.801 


.659 


.590 


.497 




1.000 


.622 


.654 


.754 


.699 


.625 


.131 






1.000 


.670 


.819 


.650 


.652 


.279 








1.000 


.897 


.635 


.551 


.3/5 










1.000 


.676 


.665 


.369 






• -» ■» 






1.000 


.618 


.321 






- 








1.000 


.351 



1.000 




Table 3 

Summary Table of the Stepwise Multiple Regression 



. i 



umber 



Variable 

Entered Removed 


Multiple 
R • 


Increase 
In RSg 


F Va1u< 
Enter or 


.PHA Total 

* 


.6764 


.457 6 


105.45 


> * * * -* 

SIT 


.7129 


.0506 


12.77 


PHA Verbal Meaning 


.7324 


.0282 


7.48 


PMAJSp^ ^ 1 a i.Relati ons 


‘ .7414 


.0133 


3.60 


PMA Total 


.7414; 


-.0000 


.00 


PHA Humber Facility 


.7516 


.0152 


4.28 


PHA Total 


.7554 


.0057 


1.60 


PHA Perceptual Speed 


.7596 


.0063 


1.78 


' CA 


.7602'. : 

S 


.0009 . 


.26 



1 



Table 4 




St«p Variable Entered 

dumber 



Multiple 

R 



Increase F Value 

In RSQ. To Enter 



Criterion: 


Reading Vocabulary 


1 


PMA Total 


2 


Verbal Meaning 


3 


Spatial Relations 


4 


SIT HA 


Criterion: 


Reading Comprehension 


1 


Verbal Meaning 


2 


Spatial Relations 


Criterion: 


Total Reading 


1 


Verbal Meaning 


2 


Spatial Relations 


3 


SIT HA 


Criterion: 


Arithmetic Reasoning 


1 


Number Facility 


2 


SIT MA 


3 


Spatial Relations 


Criterion: 


Arithmetic Fundamentals 


1 


Number Facility * 


2 


SI'f.NA 




PMA Total Score 


Cri terion: 


Total Arithmetic 


1 


Number Facility 


2 


SIT MA . 


3 


Spatial Relations 


Criterion: 


Mechanics of English 


1 


Verbal Meaning 


2 


SIT MA 


3 


Chronological Age 


4 


Perceptual Speed 


Criterion: 


Spelling 


1 


Verbal Meaning 


2 


Number Facility 


Criterion: 


Total Lsngu< ge 


1 


Verbal Meaning 


2 


Number Facility 


3 


SIT HA 


4 


Chronological Age 



.6248 

.6491 

.6693 

.6301 




.3904 

.0310 

.0266 

.0145 


30.0417 

6.6418 

5.9291 

3.2942 


.6233 

.6480 


- w 


.3885 

.0314 


73.4093 

6.7216 


.6300 

.6i.2$ 

.6$47 




.3969 

.0694 

.0163 


* 32.2764 

16.1212 
3. £677 


.7265 

♦it / r 

•7uop 

...052 




.527S 

.0907 

.029S 


139.7362 

29.4973 

10.4031 


.7237 

.7557 

.7663 




.5324 

.0367 

.0161 


142.3161 
11 • 2026 
4.7947 


.7760 

.6223 

.6347 




.6022 

.0740 

.0205 


13$. 2239 
28.3288 

b.3l6l 


.6164 

.6553 

.cols 

.7020 




.3799 

*.0503 

.0340 

.0277 


76.5887 

10.9507 

3.0044 

6.6644 


.5077 

.5265 




.2573 

.0195 


43.4175 

3.3301 


.6064 

.6417 

.6540 

.6636 




.3677 

.0440 

.C160 

.0130 


72.6b$5 

9.2791 

3.4365 

2.8272 



tively biased the population. It weeded out the less verbal 
children with MA’s below seven years and those children with 
the strongest local dialect. These observations, made by the 
principal invest i gator* rather than the actual data itself, has 
lead the experimenter to conclude that the CAT was a poor choice 
of criterion measure for such a population. Too many children 
were working at the bottom of the test apparently because of 
the nature of the instructions and the nature of their language 
behavior. Whenever protocols were suspected of being contaminated 
by this influence, they were discarded. It is suspected that 
the CAT total 'score would have been higher if the above language 
problem had not been encountered, in which case the obtained 
correlations would have altered somewhat. 

Speculation based on the observable trends leads to the 
supposition that the alteration might have been in the direc- 
tion of Mueller’s findings of r^O. 79- 

in spite of the attrition problem it is safe to conclude 
that when-rsix or- more children are to be examined, an economy 
of test administration time can be gained without loss of 
validity by using the group test rather than the individual 
test. Those trends that exist all indicate that the PHA may 
be more valid than the SIT and is not less valid, when used 
with Southern, elementary level, EMR children. 
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